Fix #3548: Match MathTex subscripts/superscripts by position #4457

Nikhil172913832 · 2025-10-25T10:51:42Z

Overview: What does this pull request change?

Fixes issue where MathTex submobjects did not correctly correspond to their tex_strings when using subscripts and superscripts in different orders. The fix uses geometric position matching specifically for script elements (^, _) to handle LaTeX's reordering while preserving sequential matching for non-script elements.

Motivation and Explanation: Why and how do your changes improve the library?

Problem: LaTeX compiles expressions like A ^n _1 and A _1 ^n to identical SVG output where subscripts and superscripts may appear in a different order than specified. This caused MathTex('A', '^n', '_1') and MathTex('A', '_1', '^n') to have submobjects that didn't match their original tex_strings, breaking operations like get_parts_by_tex() and set_color_by_tex().

Solution: Modified _break_up_by_substrings() method to detect script elements (tex strings starting with ^ or _) and match them to rendered submobjects based on geometric position (center point). Non-script elements continue using sequential matching to maintain backward compatibility and avoid issues with complex formulas.

Impact: Users can now reliably access and manipulate subscripts/superscripts by their tex strings regardless of the order they're specified.

Links to added or changed documentation pages

No documentation changes required.

Further Information and Comments

Fixes Rendered submobjects of MathTex may not correspond to their tex strings #3548
All existing tests pass (20/20 in test_texmobject.py)
Added regression test: test_tex_strings_with_subscripts_and_superscripts()

Reviewer Checklist

The PR title is descriptive enough for the changelog, and the PR is labeled correctly
If applicable: newly added non-private functions and classes have a docstring including a short summary and a PARAMETERS section
If applicable: newly added functions and classes are tested

…ition Use geometric matching for script elements to handle LaTeX reordering while preserving sequential matching for non-script elements.

for more information, see https://pre-commit.ci

henrikmidtiby · 2025-10-31T21:04:01Z

Thanks for the PR.

I have tried to run your new test without the suggested changes to the tex_mobject.py file.
I would expect the test to reveal an issue with the matching process.
But on my computer the test passes without any issues...

Similarly I have tested the full PR on the two scenes (Minimal and MinimalWithSum) reported in issue #3548. Both scenes still fail even with the changes from this PR active.
Do you get similar results?

…tch by Y position

Nikhil172913832 · 2025-11-01T04:51:15Z

@henrikmidtiby Thanks for pointing that out. I had initially overlooked the fix, and since the test passed, I missed verifying whether the original issue was actually resolved. I’ve now updated the test and revised my approach. The test correctly fails on the main branch now.

for more information, see https://pre-commit.ci

henrikmidtiby · 2025-11-03T17:52:22Z

Good progress.

I have tried to apply the current PR to the following test case (from #3548)

from manim import *

class MinimalWithSum(Scene):
    def construct(self):
        """ This shows that substring may not correspond to tex shape """
        t2cm = {'\sum': BLUE, '^n': RED, '_1': GREEN, 'x':YELLOW}
        eq1 = MathTex('\sum', '^n', '_1', 'x', tex_to_color_map=t2cm)
        eq2 = MathTex('\sum', '_1', '^n', 'x', tex_to_color_map=t2cm)

        font = {'font_size': 24}
        txts = [Text(sub.get_tex_string(), t2c=t2cm, **font) for sub in (eq1, eq2) for i in range(len(sub))]
        txt1 = VGroup(*txts[:4])
        txt2 = VGroup(*txts[4:])

        cap1 = Text('tex rendered', **font)
        cap2 = Text('tex substrings', **font)
        
        grp = VGroup(cap1, cap2, eq1, txt1, eq2, txt2).arrange_in_grid(3,2)
        grp.scale(2).move_to(ORIGIN)
        self.add(grp)

Which renders as shown here.

Which is more consistent than if I render the scene using the current main branch, that produces this output.

I still think that the coloring is off in both cases, as I would expect the summation signs to be blue.

In addition I wonder if it is possible to extract some of the functionality into a separate method. The intention here is to make it easier to understand what the code is actually doing.
Prior to this PR I had to pay close attention to understand the 26 lines of code in the _break_up_by_substrings method. The method is now close to 100 lines and I haven't yet managed to really understand what is happening (e.g. why should the order of the sorted_pool be reversed in some cases).

Nikhil172913832 · 2025-11-04T06:52:57Z

@henrikmidtiby does it look correct now?

henrikmidtiby · 2025-11-04T07:20:08Z

@Nikhil172913832 Much better!
This is exactly what I would expect from reading the code for the MinimalWithSum scene.

…_break_up_by_substrings function to improve understandability

for more information, see https://pre-commit.ci

Nikhil172913832 · 2025-11-06T08:19:59Z

@henrikmidtiby I’ve made the necessary changes related to the colors. Please let me know if everything looks good.

henrikmidtiby · 2025-11-06T22:32:30Z

Nice to see your progress on this.
I have attempted to find an example where the colors of the parts of the MathTex is assigned in an unwanted way. Until now I haven't been successful at that. However I have found this example, where parts of the extracted tex strings seems to duplicated in certain conditions.

from manim import *

class MinimalWithSumDifficult(Scene):
    def construct(self):
        """ This shows that substring may not correspond to tex shape """
        t2cm = {'\sum': BLUE, '^n': RED, '_1': GREEN, 'x':YELLOW}
        eq1 = MathTex(r'\sum', '^n', '_1', 'x', '^2', '= n_2', tex_to_color_map=t2cm)
        eq2 = MathTex(r'\sum', '_1', '^n', 'x', '^2', '= n_2', tex_to_color_map=t2cm)

        font = {'font_size': 24}
        txts = [Text(sub.get_tex_string(), t2c=t2cm, **font) for sub in (eq1, eq2) for i in range(len(sub))]
        for txt in txts: 
            print(txt)
        txt1 = VGroup(*txts[:4])
        txt2 = VGroup(*txts[4:])

        cap1 = Text('tex rendered', **font)
        cap2 = Text('tex substrings', **font)
        
        grp = VGroup(cap1, cap2, eq1, txt1, eq2, txt2).arrange_in_grid(3,2)
        grp.scale(1).move_to(ORIGIN)
        self.add(grp)

On my computer it renders as shown here:

It seems like the strings "^n" and "_1" have been duplicated in the lower equation.

henrikmidtiby · 2025-11-06T22:39:21Z

Now I managed to find a case, where the new code seems to render the equation badly.

from manim import *

class MathTexUnexpectedBehaviour(Scene):
    def construct(self):
        t = MathTex("\\int^b{{_a}} dx = b - a")
        self.add(t)

        t[1].set_color(RED)

Which renders as

MathTexUnexpectedBehaviour_ManimCE_v0 19 0

Where I miss the upper limit of the integral.
The issue disappears if the limits of the integral are interchanged.

from manim import *

class MathTexUnexpectedBehaviour(Scene):
    def construct(self):
        t = MathTex("\\int{{_a}}^b dx = b - a")
        self.add(t)

        t[1].set_color(RED)

Nikhil172913832 · 2025-11-07T06:45:43Z

@henrikmidtiby,, addressing your first issue:

In the original line:

txts = [Text(sub.get_tex_string(), t2c=t2cm, **font) for sub in (eq1, eq2) for i in range(len(sub))]

the expression for sub in (eq1, eq2) iterates over the MathTex objects themselves rather than their submobjects. Meanwhile, for i in range(len(sub)) loops over the number of submobjects, but Text(sub.get_tex_string(), ...) still calls .get_tex_string() on the full object instead of each submobject.

A better approach would be:

from manim import *

class MinimalWithSumDifficult(Scene):
    def construct(self):
        t2cm = {r'\sum': BLUE, '^n': RED, '_1': GREEN, 'x': YELLOW}
        eq1 = MathTex(r'\sum', '^n', '_1', 'x', '^2', '= n_2', tex_to_color_map=t2cm)
        eq2 = MathTex(r'\sum', '_1', '^n', 'x', '^2', '= n_2', tex_to_color_map=t2cm)

        font = {'font_size': 24}
        # convert each submobject to Text, arrange horizontally
        def make_text_group(eq):
            txts = [Text(str(mob.tex_string), t2c=t2cm, **font) for mob in eq]
            for i, t in enumerate(txts):
                t.next_to(txts[i - 1], RIGHT, buff=0.1) if i > 0 else None
            return VGroup(*txts)

        txt1 = make_text_group(eq1)
        txt2 = make_text_group(eq2)

        cap1 = Text('TeX rendered', **font)
        cap2 = Text('TeX substrings', **font)

        grp = VGroup(
            cap1, cap2,
            eq1, txt1,
            eq2, txt2
        ).arrange_in_grid(rows=3, cols=2, buff=0.6)

        grp.move_to(ORIGIN)
        self.add(grp)

Which renders as:

manim/mobject/text/tex_mobject.py

…ing scripts, ensuring both subscript and superscript limits are displayed for integrals

for more information, see https://pre-commit.ci

Nikhil172913832 · 2025-11-07T07:09:12Z

@henrikmidtiby, addressing your second issue:

MathTex("\\int{{_a}}^b dx = b - a")

was missing the upper limit b because my code was incorrectly trying to pair \int^b with _a as a base+scripts group.

After looking into it, I found that the base element \int^b already contained its own superscript ^b, but my code didn’t detect this and still applied the base+scripts logic, consuming submobjects incorrectly.

I added a check to skip the base+scripts logic if the base element’s tex_string already contains ^ or _, indicating that it already has scripts attached.

Now:

from manim import *

class MathTexUnexpectedBehaviour(Scene):
    def construct(self):
        t = MathTex("\\int^b{{_a}} dx = b - a")
        self.add(t)
        t[1].set_color(RED)

renders as:

henrikmidtiby · 2025-11-08T20:02:30Z

@Nikhil172913832
Thanks for addressing the two issues.
This PR clearly improves how the tex_to_color_map option is handled, and makes it usable in more situations.

I used some time to search for an example where the PR would fail.
It took some time but eventually I found the following:

from manim import *

class MinimalWithSumVeryDifficult(Scene):
    def construct(self):
        t2cm = {r'\sum': BLUE, 'n_2': RED, '_1': GREEN}
        eq1 = MathTex(r'\sum^{n_2^3}_1', tex_to_color_map=t2cm)
        eq2 = MathTex(r'\sum_1^{n_2^3}', tex_to_color_map=t2cm)

        font = {'font_size': 24}
        # convert each submobject to Text, arrange horizontally
        def make_text_group(eq):
            txts = [Text(str(mob.tex_string), t2c=t2cm, **font) for mob in eq]
            for i, t in enumerate(txts):
                t.next_to(txts[i - 1], RIGHT, buff=0.1) if i > 0 else None
            return VGroup(*txts)

        txt1 = make_text_group(eq1)
        txt2 = make_text_group(eq2)

        cap1 = Text('TeX rendered', **font)
        cap2 = Text('TeX substrings', **font)

        grp = VGroup(
            cap1, cap2,
            eq1, txt1,
            eq2, txt2
        ).arrange_in_grid(rows=3, cols=2, buff=0.6)

        grp.move_to(ORIGIN)
        self.add(grp)

I don't know if it is possible to make this work for all potential cases, without reimplementing most parts of the external latex parser. I don't think that would be worth the effort though.

In search of an alternative, I managed to to find a post in the #dev-chat on the manim discord server, where Benjamin Hackl mentioned a potentially more stable approach some time ago.
https://discord.com/channels/581738731934056449/1023550532914266142/1406959019868029043

Benjamin Hackl — 8/18/25, 1:12 PM
I learned something completely insane yesterday, which stronly motivates completely rewriting Tex and friends. did any of you know that it is possible to insert commands in a given TeX code that are being picked up by dvisvgm? we can actually insert a bunch of <g id="manim-group-xyz"> </g> in the SVG produced by dvisvgm 👀

uwezi — 8/18/25, 2:23 PM
how? That sound quite useful!

Benjamin Hackl — 8/18/25, 2:27 PM
indeed, and i think it resolves all sort of TeX-splitting issues; from the bit of testing I did yesterday it actually seemed quite robust. And no need to artificially split TeX strings anywhere...

the command is simply
\special{dvisvgm:raw <g id="something-unique">}
...
\special{dvisvgm:raw </g>}

and after generating the corresponding svg via dvisvgm the glyphs resulting from the TeX code in between should be wrapped in a proper svg group with the given id. 👀

Nikhil172913832 · 2025-11-09T05:46:16Z

@henrikmidtiby Thanks for the detailed feedback and for sharing that Discord thread. I agree — handling every TeX edge case isn’t practical without deeper parsing. It makes sense to wait for the \special{dvisvgm:raw} approach rather than adding a temporary fix.

Fix ManimCommunity#3548: Match MathTex subscripts/superscripts by pos…

47c6720

…ition Use geometric matching for script elements to handle LaTeX reordering while preserving sequential matching for non-script elements.

github-project-automation bot added this to Dev Board Oct 25, 2025

github-project-automation bot moved this to 🆕 New in Dev Board Oct 25, 2025

[pre-commit.ci] auto fixes from pre-commit.com hooks

0c0e013

for more information, see https://pre-commit.ci

Nikhil172913832 added 2 commits November 1, 2025 10:01

Merge branch 'main' into fixing_issue_3548

928a200

Improve fix for ManimCommunity#3548: Group consecutive scripts and ma…

c135c02

…tch by Y position

[pre-commit.ci] auto fixes from pre-commit.com hooks

084b82f

for more information, see https://pre-commit.ci

Nikhil172913832 and others added 2 commits November 6, 2025 11:51

Fix the coloring schems to make sure it is consistent and refactored …

5e1070f

…_break_up_by_substrings function to improve understandability

[pre-commit.ci] auto fixes from pre-commit.com hooks

5d0c384

for more information, see https://pre-commit.ci

Merge branch 'main' into fixing_issue_3548

5e11f49

github-advanced-security bot found potential problems Nov 7, 2025

View reviewed changes

manim/mobject/text/tex_mobject.py Fixed Show fixed Hide fixed

Fix MathTex substring extraction to correctly handle bases with exist…

adc71b4

…ing scripts, ensuring both subscript and superscript limits are displayed for integrals

Nikhil172913832 force-pushed the fixing_issue_3548 branch from 6c84a90 to adc71b4 Compare November 7, 2025 07:02

[pre-commit.ci] auto fixes from pre-commit.com hooks

d8122d1

for more information, see https://pre-commit.ci

Merge branch 'main' into fixing_issue_3548

79d8a00

Fix #3548: Match MathTex subscripts/superscripts by position #4457

Are you sure you want to change the base?

Fix #3548: Match MathTex subscripts/superscripts by position #4457

Conversation

Nikhil172913832 commented Oct 25, 2025

Overview: What does this pull request change?

Motivation and Explanation: Why and how do your changes improve the library?

Links to added or changed documentation pages

Further Information and Comments

Reviewer Checklist

Uh oh!

henrikmidtiby commented Oct 31, 2025

Uh oh!

Nikhil172913832 commented Nov 1, 2025

Uh oh!

henrikmidtiby commented Nov 3, 2025

Uh oh!

Nikhil172913832 commented Nov 4, 2025

Uh oh!

henrikmidtiby commented Nov 4, 2025

Uh oh!

Nikhil172913832 commented Nov 6, 2025

Uh oh!

henrikmidtiby commented Nov 6, 2025

Uh oh!

henrikmidtiby commented Nov 6, 2025

Uh oh!

Nikhil172913832 commented Nov 7, 2025

Uh oh!

Uh oh!

Nikhil172913832 commented Nov 7, 2025

Uh oh!

henrikmidtiby commented Nov 8, 2025

Uh oh!

Nikhil172913832 commented Nov 9, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants